A New Method to Cluster HTML Documents Using Mixed Algorithms

Maryam Shoar; Ali Asghar Salarnezhad

Volume 6, Issue 24 , May 2018, , Pages 37-62

https://doi.org/10.22054/ims.2018.8891

Abstract
    Given the high volume of web information, more attention has been paid to the automatic data extraction systems. One of the most important methods of data extraction is clustering. Today, many clustering methods are provided which are mostly based on vector models. In these models, each document ...  Read More